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1 Introduction and outline 



The Einstein-Hilbert action for general relativity (GR) is 



d'^x \/—gR. 



(1.1) 



When varying this action, one finds surface contributions that must vanish if 
the action is to be stationary. The surface contributions contain the metric 
variation 5gfj,i, and variations of the derivatives of the metric S^dagfiu). Setting 
SOfiu = on the boundary is not sufficient to kill all the surface contributions. 
Fixing both the metric and the derivatives of the metric on the boundary 
is uncomfortable, however it may not be initially obvious why this is so, or 
that there exists a unique and correct prescription for dealing with it. 

Gibbons, Hawking, and York proposed adding the trace of the extrinsic 
curvature of the boundary, K, to the action [H [2] . With this modification 
the action takes the form 



h is the determinant of the induced metric on the boundary. This modifica- 
tion is appealing because the variation of the Gibbons-Hawking- York (GHY) 
boundary term cancels the terms involving 5{dag^u), and so setting 5gfj_y = 
becomes sufficient to make the action stationary. This has been widely ac- 
cepted as the correct modification to the action. 

This modification raises some questions. Is the modification unique? Is 
it necessarily incorrect to require 5{dag^v) = 0? If this is incorrect, why is it 
appropriate to require the fixing of all components of g^j^^, when the graviton 
has only two degrees of freedom? More generally, what criteria determine 
which quantities to fix on the boundary? Should it be related to the number 
of degrees of freedom in the theory? 

Despite these questions the GHY term is desirable, as it possesses a num- 
ber of other key features. The term is required to ensure the path integral 
has the correct composition properties [1]. When passing to the hamiltonian 
formalism, it is necessary to include the GHY term in order to reproduce the 
correct Arnowitt-Deser-Misner (ADM) energy |3j. When calculating black 




(1.2) 
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hole entropy using the euchdean semiclassical approach, the entire contribu- 
tion comes from the GHY term [H]. These considerations underscore the 
necessity of a complete understanding of boundary terms. 

Higher derivative theories of gravity have attracted attention recently, 
mostly as modifications to GR that have the potential to explain cosmic 
acceleration without vacuum energy. A simple example of such a modification 
is F{R) gravity, where the action becomes some arbitrary function, F, of the 
Ricci scalar (for reviews, see IHE]), 



It is well known that this theory is dynamically equivalent to a scalar-tensor 
theory [251 EHl EH EHl EO] . By extending this equivalence to the GHY term, 
we find that the F{R) action is left with a boundary term. 



This boundary term must be present if the correspondence to scalar-tensor 
theory is to hold at the boundary. 

This term has been arrived at before, both directly and indirectly, and in 
several different contexts [ini HSl HZl [IHl [IS] • There has been some confusion, 
as this boundary term does not allow S{dug^u) to remain arbitrary on the 
boundary. Various ways around this have been attempted, for example, when 
restricting to maximally symmetric backgrounds a boundary term for F{R) 
theory can be found that allows 5{dag^v) to remain arbitrary on the boundary 



F{R) theory is a higher derivative theory, however, and the derivatives of 
the metric encode true degrees of freedom. 5{dcjg^y) should not remain arbi- 
trary on the boundary. Instead, 5{dc,g^u) must be subject to the constraint 
that the variation of the four- dimensional Ricci scalar be held fixed on the 
boundary. This corresponds to holding the scalar field fixed in the equivalent 
scalar-tensor theory. We will show that the above boundary term reproduces 
the expected ADM energy upon passing to the hamiltonian formalism, and 
the expected entropy for black holes. 

In what follows, we will elaborate upon the above in detail. 




(1.3) 




(1.4) 
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An outline of this paper is as follows. In section |2| we go over the criteria 
that are necessary to have a well-posed variational principle. In section |3} 
we examine several toy examples of lagrangians in classical mechanics that 
share some of the features of the more complicated GR case. We discuss 
what adding total derivatives can do to the variational principle, and what 
happens in higher derivative theories. We also discuss the complications due 
to constraints and gauge invariance, and we work out the case of electro- 
magnetism. In section |4| we review the GHY term, its variation, and how it 
renders the GR action well-posed. In section [5} we discuss higher derivative 
modified gravity theories, scalar-tensor theories, and their equivalence, using 
F{R) theory as the prime example. We find the boundary terms for F{R) 
theory using this equivalence. In section [6} we derive the hamiltonian formu- 
lation of scalar-tensor theory and F{R) theory, keeping all boundary terms, 
and show how the boundary term is essential for obtaining the ADM energy. 
In section [7| we calculate the entropy of a Schwartzschild black hole in F{R) 
theory, using the euclidean semi-classical approach, and compare the result 
to the Wald entropy formula. We conclude in section |8} and some formulae, 
theorems and technicalities are relegated to the appendices. 

Conventions: We use the conventions of Carroll [20] • These include us- 
ing the mostly plus metric signature (—,+,+,+,...), and the following defi- 
nitions of the Riemann and Ricci tensors, R'^x^u = dpT^x ~ duT'^^ + ^'^ri^lx " 
^uri^]ix^ = R^Xfiu- The weight for anti-symmetrization and symmetriza- 
tion is, e.g. A^^^^ = | {A^^, — A^^). For spacelike hyper surf aces, the normal 
vector will always be inward pointing. For timelike hypersurfaces, it will be 
outward pointing. The dimension of spacetime is n. Further conventions and 
notation for foliations of spacetime are laid out in appendices |A] and [B] 



2 What makes a variational principle well- 
posed? 

A traditional approach to field theory is to integrate by parts at will, ignor- 
ing boundary contributions. The reasoning is that if there are no physical 
boundaries in the space under consideration, or if they are so far away that 
their effects can be expected not to interfere with the system under study, 
then they can be ignored. 
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If one is interested only in the local equations of motion of a theory, this 
is a valid approach. In this case, one uses the action only as a formal device 
for arriving at the equations of motion. Denoting the fields collectively by 
(f)\ one writes down the lagrangian, which is a local density, 

cm,x). (2.1) 

Here [cp] stands for dependence on 0* and any of its higher derivatives, [(/)] = 
(f)\ 9^0*, d^du(f)\ . . ., and x are the coordinates on spacetime. One then defines 
the equations of motion to be the Euler-Lagrange equations 

where 



"^M "-I/O ii\ ' '-'u'-'i' 



is the Euler-Lagrange derivative and the symmetric derivative is defined by 

(The symmetric derivative is to avoid over-counting multiple derivatives 
which are not independent.) 

All the relevant local theory, i.e. equations of motion, symmetries, con- 
served currents, gauge symmetries, Noether identities, etc., can be developed 
in terms of the lagrangian density alone, without ever writing an integral and 
without consideration of boundary contributions. For example, Noether's 
theorem can be stated as: an infinitesimal transformation 50*([0],x) is a 
symmetry if the lagrangian changes by a total derivative, 

?>C = a^A;^ (2.4) 

from which one can show directly that the following current is conserved on 
shell 

f = -/^^ . E • • • ^.^-M E(-i)'-^.^ • • • ^.^-. a(a.a.....v.,r 

(2.5) 
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(This reduces to = '^0*g(f^ ~ in the usual case where the lagrangian 
depends at most on first derivatives of the fields.)]^ 

Without specifying boundary conditions, the Euler-Lagrange equations 



(2.2) typically have many solutions (if there are no solutions, the lagrangian 
is said to be inconsistent). To select out the true solution, boundary condi- 
tions must be set. There should be some class of boundary conditions that 
render the system well-posed. A class of boundary conditions is well-posed 
if, given any choice of boundary conditions from this class, there exists a 
unique solution to the system compatible with that choice. The equations of 
motion are typically hyperbolic, so the class will generally involve all possi- 
ble choices of initial conditions and velocities for the fields, and all possible 
choices of spatial boundary conditions at all times. There may be several 
different classes of well-posed data, but each class will involve specifying the 
same number of boundary data. This number is the number of degrees of 
freedom in the theory. 

The choice of spatial boundary conditions generally corresponds to a 
choice of "vacuum state" for the theory. For example, in GR, if spatial bound- 
ary conditions (fall-off behavior for the fields, in this case) are chosen so that 
the spacetime is asymptotically flat, we find ourselves in the asymptotically 
flat vacuum of the theory. A choice of initial condition generally corresponds 
to a choice of state within the vacuum. For example, both Minkowski space 
and the Schwartzschild black hole have the same spatial boundary behavior, 
so they are both in the asymptotically fiat vacuum of GR, but they have 
different initial data, and so they represent different states. Thus, from the 
point of view that the action is just a formal device to arrive at the local 
equations of motion, information about the possible vacua of the theory, and 
the space of states in each vacuum, is not encoded directly in the action, 
only indirectly through the boundary conditions required of the equations of 
motion. 

Ideally, the action should do more. We want to say that the true field 

^If one wishes, still keeping with this line of thought, the action integral can be intro- 
duced as a formal device, 

J d"a;/:([0],x). (2.6) 

The integration region need not be specified, and the equations of motion are obtained by 
setting SS = and integrating by parts ignoring all boundary contributions. 
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Figure 1: Region over which a field theory is defined. 



configuration, among all field configurations with given boundary data, ex- 
tremizes the action. Consider a spacetime region, V, such as that in figure 
[1} The boundary of this region consists of a timelike part B, (which may be 
at infinity), which we will refer to as the spatial boundary, and two space- 
like edges. El and E2, which we will refer to as the endpoints. We consider 
configurations of the fields 0* in V that have some given boundary values, 
0*(i3), as well as initial and final values, 0*(Si), 0*(S2) (possibly involving 
derivatives of the fields or other combinations), chosen from a class which is 
well-posed, meaning that for each choice in the class there exists a unique 
solution to the equations of motion]^ 

The action is a functional of field configurations in V, 



^ Well-posedness of hyperbolic equations usually requires initial data, such as the values 
of 0* and its time derivatives at Si, rather than endpoint data such as the value of 0* at 
El and S2- The action naturally wants endpoint data, however. As we will elaborate on 
in the next section, endpoint data is usually just as well posed as initial data, with the 
exception of "unfortunate choices" which are measure zero in the space of all choices. 




(2.7) 




(2.8) 
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The criteria for a well-posed action should be the following. // we restrict 
the action to those field configurations consistent with our choice of boundary 
and initial data, 



the unique solution to the Euler- Lagrange equations should be the only ex- 
tremum of the action. It is important that the space of paths over which we 
vary be all paths that have the given boundary data. 

Of course, a necessary condition for the action to be stationary is that 
the field satisfy the Euler-Lagrange equations 



For the action to be truly stationary, any boundary contributions arising 
from the variation must vanish. No conditions other than those implicit 
in the class of well-posed data may be used in checking whether boundary 
contributions vanish. 

There will be many possible choices of the boundary data (corre- 
sponding to different choices of vacuum) and endpoint data 0*(Si), 0*(E2) 
(corresponding to different choices of the state) within a well-posed class. 
Each such choice leads to a different set of paths to consider in the action, 
and a well-posed action will give a unique stationary field configuration for 
each. The action, when evaluated on the classical path 0*, is thus a well 
defined functional on the class of possible boundary data, 



The number of physical degrees of freedom in the theory is the number of 
free choices of (f)'^{dV) (modulo gauge invariance, see later), i.e. the size of 
the class of well-posed boundary data. 

These criteria are essential to setting up a Hamilton- Jacobi theory, where 
the stationary value of the action, as a function of endpoint data, gives the 
canonical transformation that solves the equations of motion These 




(2.9) 




(2.11) 
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criteria are also essential to setting up a quantum theory based on the path 
integral, p. 

We will now turn to some examples. We will look at some actions which 
are well-posed in the sense we have just described, and some which are not. 
We will see that adding boundary terms or, equivalently, total derivatives 
to the action affects whether the action can be well-posed. Second class 
constraints and gauge freedom complicate the issue, and we will have more 
to say about these as well. 



3 Some Toy Examples 

We start with some simple examples from point particle mechanics. These are 
one-dimensional field theories, with fields g*(t) that depend on one parameter, 
the time t. The issue of spatial boundary data does not arise in this case; 
it is only the endpoint data, and g*(t2) that we need to worry about 

The number of degrees of freedom in the theory is half the number of free 
choices that may be made in specifying endpoint data. 



3.1 Standard classical mechanics 

The best example of a well-defined variational principle is an ordinary uncon- 
strained lagrangian system for n variables q\t), i = 1, . . . ,n without higher 
derivatives. 



The action is a function of the endpoint data q\, and the endpoint times 
ti,t2- The set of paths over which we vary is all satisfying = q\ 

In this sense, one dimensional theories do not possess different vacua, only a space of 
states, given by the various values the endpoint data are allowed to take. 




(3.1) 




(3.2) 
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and g*(t2) = 92- Varying the action we have 



SS 



t2 



dt 



dL 



d^dL_ 
dt dq" 



t2 



tl 



(3.3) 



The boundary variation vanishes since the are fixed there, so the nec- 
essary and sufficient condition for the action to be stationary is that the 
Euler-Lagrange equations of motion hold, 



dL d dL _ 
dq^ dt d(f 

or, upon expanding out the time derivative. 



0, 



(3.4) 



d^L 
d(fdq- 



dL 



d'^L 



dq^ dq^dq^ 



(3.5) 



Because of (3.2), we can solve algebraically for the highest derivative in 



terms of the lower derivatives, and we have a well formed initial value prob- 
lem: given a choice from the class of data g*(ti) and q^{ti), we will have a 
unique solution to the equations of motion. 

However, the action principle is not well suited to the initial value class. 
It is suited to the class where we choose endpoint data, i.e. where we fix 
on the two endpoints but do not specify the g* anywhere. For a second order 



differential equation such as (3.5), the endpoint problem will be well posed 
for all but a few "unfortunate" choices of the endpoint data (borrowing the 
language of [23]). An example of such an unfortunate choice is the following. 
Take the action of a simple harmonic oscillator in one dimension, with unit 
mass and angular frequency to, S = j'l^ dt ^ (g^ — tu^g^). The equation of 
motion is g + uo'^q = 0. If the time interval is chosen to be a half integer 
multiple of the period of the oscillator, t2 — ti = , n = 1,2, . . and we 
choose gi = g2 = 0, then there will be an inffnite number of distinct solutions 
to the equations of motion compatible with the endpoint data. This reffects 
the fact that the period of oscillations is independent of the amplitude. It 
is not hard to think of other such examples of "unfortunate" choices. The 
problem is that the equations of motion want initial conditions, but the action 
principle wants endpoint conditions, and the result is a failure of the action 
principle to be well deffned for all choices. 
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These unfortunate choices will make up a set of measure zero among all 
possible choices for the endpoint data. This is because finding degeneracies in 
the initial value formulation is equivalent to an eigenvalue problem over the 
interval t2 — ti and the eigenvalues are discrete. We will still consider such 
a variational principle well-posed. With this caveat, the action principle 
above, using the class of endpoint data, is well-posed, because for almost 
any choice of endpoint data, q\,q2,ti,t2, there is a unique solution to the 
equation of motion, corresponding to the unique path which extremizes the 
action. The action, evaluated on this path, becomes a well defined functional 
of q{,q2,ti, t2 almost everywhere, and so the number degrees of freedom (i.e. 
the size of the class of data), or the possibility of differentiating the action 
with respect to endpoint data is not affected. There are 2n free choices 
among the field endpoint values, so the model has n degrees of freedom. 

Going to the hamiltonian poses no problem. We introduce fields Pi{t), 

V. = (3.6) 



which by virtue of (3.2) can be inverted to solve for = q^{p,q)- We then 
write the action ^ 

S= f'dtpiq'-H, (3.7) 
Jti 

where 

Hip,q)=ip.<r-L)\^^^,.^^^^. (3.8) 

The variation with respect to the p^'s is done without fixing the endpoints, 
so their endpoint values are not specified, and the action is not a function 



of them. The equations of motion for p are (3.6), which upon plugging 
back into the action reproduces the original lagrangian action (the transition 
to the hamiltonian is a special case of the fundamental theorem of auxiliary 
variables, see appendixpL Note that adding a total derivative, ^F{q), of any 
function of the g* to the original lagrangian does not change the variational 
principle or the equations of motion, but does change the canonical momenta 
in a way that amounts to a canonical transformation. 
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3.2 Constraints 



Consider now a lagrangian that does not satisfy det {^ Q^iQqj j 7^ 0. This is 

the case for essentially all theories of interest to most high energy physicists, 
with the exception of scalar field theories on fixed backgrounds. The theory 
then has constraints, and the Dirac constraint algorithm or some equivalent 
method must be applied [2^ [23]. 

As an example, consider a theory with a free point particle of mass m, 
labeled by qi, along with a harmonic oscillator of mass M and angular fre- 
quency u, labeled by q2, coupled through a derivative interaction Xqiq2, 

S = jjdt \nql + ]^Mql - ^Mu^ql + Xq^q^. (3.9) 

This lagrangian is free of constraints except when A = ±v^ Mm, in which 
case there is a single primary constraint and a single secondary constraint, 
which taken together are second class. It is easy to see why these values of 
A are special, because for these values we can factor the action, 

S = J^' dt^ {V^Qi ± VMg2)' - ^Mu^l (3.10) 

after which a change of variables q' = \/rnqx ± \fMq2 shows that g' behaves 
as a free point particle while g2, which has no kinetic term, is constrained to 
be zero. 

Varying the action, we find the endpoint term 




(3.11) 



The action is stationary if we choose the quantity \/rnqi ± \/M q2 to be fixed 
on the endpoints. q2 must be chosen to be zero, so choosing this quantity is 
equivalent to choosing qi. 

This illustrates what should be the case in general with constrained sys- 
tems. The variations of the unconstrained variables are fixed on the bound- 
ary, and the constrained variables are not fixed. Each choice of endpoint 
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data for the unconstrained variables determines a choice of data for the con- 
strained variables, which then determines a unique solution to the equations 
of motion. The action is thus a function of the endpoint data for the uncon- 
strained variables. The action should be such that even though all variables 
may appear in the endpoint variation, the number that must be fixed to 
make the action stationary is fewer, equal in number to the number of un- 
constrained degrees of freedom. 



3.3 Gauge invariance 

Gauge invariance also complicates the well-posedness of variational princi- 
ples. Consider the following action for two variables qi and q2- 



Varying with respect to gi gives the equation of motion gi = q2 with no 
endpoint contribution, qi can thus be eliminated as an auxiliary field, and 
plugging this back into the action gives identically zero. The general solution 
to the equation of motion is to let q2 be a completely arbitrary function of 
t, and then set qi = q2- The presence of arbitrary functions in the general 
solution to the equations of motion is the hallmark of a gauge theory, and 
indeed this theory is invariant under the gauge transformation 6qi = e, 5q2 = 
e, where e{t) is an arbitrary function of t. 

Arbitrary functions in the sohition also indicate that the action principle 
is not well defined as it stands. Given endpoint data, we can make a gauge 
transformation that changes the solution somewhere in the bulk, away from 
the endpoints, and so the solution for the given endpoint data will not be 
unique. We salvage uniqueness by identifying as equivalent all solutions 
which differ by a gauge transformation. For example, given any q2{t), we 
can, without changing the endpoint values ^2(^1), ?2 (^2), bring it to the gauge 
q2 — 0, by making a gauge transformation with the gauge parameter e(i), 
obtained by solving e + q2 — subject to the endpoint conditions e(ti) = 
e(t2) = 0. Then, given any choice of 52(^1), ^2(^2), there is a unique solution 
compatible with q2 = 0. 

The gauge condition ^'2 = still allows residual gauge transformations, 
those where e{t) is linear in t. These gauge transformations do not vanish at 




tl 



(3.12) 
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the endpoints. If a gauge transformation does not vanish at the endpoints, 
we must identify endpoint data that differ by such a gauge transformation. 
In the case above, ^2(^1) and ^2(^2) can be set to any desired value by a 
residual gauge transformation, so all the q2 endpoint data is to be identified 
into a single equivalence class. The qi data is constrained, so in total this 
model has only a single state, and carries no degrees of freedom. 

The variational principle is still well-posed. The action is stationary with- 
out fixing the constrained variable qi. The unconstrained variable q2 must 
be fixed, so the action is a function of its chosen boundary values. How- 
ever, gauge invariance of the action ensures that the action takes the same 
value over all the gauge identified endpoint data, so it is well defined on the 
equivalence classesF] 



3.4 Field theory/spatial boundaries 

As an example of a field theory with gauge invariance, we will show how the 
action for electromagnetism is well-posed. Consider Maxwell's equations for 
a vector field A^, to be solved in a region such as that shown in figure [T| 



They are gauge invariant under ^ A^j, + d^K for any function A(x). 

We wish to know what boundary data are required to make this system 
well-posed. Start by fixing Lorentz gauge d^A^ = in the bulk, by choosing 
A to satisfy DA = —d^j_A^. The equations of motion are then equivalent to 



These still allow for a residual gauge transformation A^ ^ A^ + d^A where 
A satisfies DA = in the bulk. Solving the equation DA = requires 

"^In general, gauge transformations may force the identification of other data besides the 
endpoint data. An example is the lagrangian for a relativistic point particle which contains 
reparametrization invariance of the world-line. In this case, it is the time interval t2 — ti 
that is identified [48 . In the case of general relativity, there are gauge transformations 
of both kind. Spatial diffeomorphisms(generated by the momentum constraints) generate 
equivalences among the endpoint data, whereas the diffeomorphisms generated by the 
hamiltonian constraint generates equivalences among the boundary data. 




(3.13) 



□A^ = 0, df.A" = 0. 



(3.14) 
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specifying A on the boundary, so the residual gauge transformations will be 
transformations of the data on the boundary, which will be used to generate 
equivalence classes. 

The wave equation OA^ = requires specifying on Si, S2 and B (ex- 
cept for "unfortunate" choices). Differentiating, we have □(c^^A^) = 0, which 
is the wave equation for the quantity dfj,A^. Thus, if we fix boundary condi- 
tions for then extend into the bulk using OA^ = 0, we need only check 
that d^A^ = on the boundary, as the extension will then automatically 
satisfy 9^v4^ = in the bulk. 

For a given choice of A'^ on the boundary, dfj^A^^ is entirely specified by 
Laplace's equation OA'^ = 0, thus there will be a single constraint on the 
allowed choices of A^ on the boundary to ensure d^A^ = there. To see the 
form of the constraint, note that the perpendicular derivatives of the A^ are 
not set as part of the boundary data, but are determined by extending the 
solution into the bulk. For instance, we can write 

5±^± = /(A±Uv,^llLv)' (3-15) 

where / is some function of the components of A'^ on the boundary specified 
by solving Laplace's equation. Therefore we need only impose the restriction 

[9i,A|j+/(A^,A||)]^^ = (3.16) 

on the boundary data. For every choice of boundary data satisfying this re- 
striction, there is a unique solution to Maxwell's equations in Lorentz gauge. 

We can think of this condition as determining A±, given A\\, so we obtain 
a unique solution given any choice of A\\, with A± determined through the 
constraint. To have a well defined variational problem given this choice of 
the free data, the action should be stationary with only A\\ fixed on the 
boundary. 

We now return to the residual gauge freedom. A gauge transformation 
such that DA = is still permissible. These residual gauge transformations 



leave the constraint (3.15) invariant, and serve to identity the unconstrained 
boundary data into equivalence classes. We can select a member of each 
class by fixing the residual gauge. For example, we can use it to set A± = 0. 
We then have two independent functions that must be specified, namely A\\, 



14 



subject to the single constraint (3.15) . These two functions correspond to 
the two polarization states of the photon. 



Consider now the Maxwell action in this volume 



S= f d". 





(3.17) 



Jv 



Variation gives 




/ 



d^x n„F>"'6A 



(3.18) 



n'^ is the outward pointing normal on the timelike surface B, and the inward 
pointing normal on the spacelike surfaces Si and S2. Notice that the ac- 
tion is stationary when only Ay is held fixed at the boundaries, due to the 
anti-symmetry of F^,^, exactly the data required of the equations of motion. 
Furthermore, we are to able to identify as equivalent any data that differ by 
a gauge transformation because the gauge invariance of the action ensures 
that it is well defined on the equivalence classes. Thus we have a well-posed 
action principle. 



3.5 Adding total derivatives 

Adding a boundary term such as the Gibbons-Hawking- York term amounts 
to adding a total derivative to the lagrangian. We would like to understand 
how adding a total derivative can change the variational problem. The bulk 
equations of motion are always unaffected by adding a total derivative, even 
one that contains higher derivatives of the fields. However, the addition of a 
total derivative may render the variational problem inconsistent, or require 
different boundary data in order to remain well-posed. 

Consider adding a total derivative to the lagrangian Li = of the free 
non-relativistic point particle, so that the action reads 



The lagrangian contains higher derivatives, but they appear only as total 
derivatives, so the equations of motion are still second order. This is analo- 
gous to the Einstein-Hilbert action without the GHY term. 




tl 



(3.19) 
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Varying the action produces the endpoint contribution 



1 

2 



m-q5q)t. (3.20) 



Setting q{ti) and 5(^2), and the associated requirement 5g = at the end- 
points, is no longer sufficient to kill the surface contribution in the variation. 
Fixing q{ti) and g(t2), as well as q{ti) and g(t2), and hence 5q = Q and 5q ~ 0, 
would be sufficient to kill the endpoint term. But this is now setting 4 pieces 
of boundary data for a second order equation of motion, so for most choices 
of data the equations of motion will have no solution. 

One might try to say the following. In the four dimensional space of 
variables that are fixed, parametrized by g(ti), 5(^2), ?(^i), ^(^2), there is some 
two dimensional subspace that is unconstrained. The others are fixed by the 
equations of motion, so the parameters of this subspace are the true degrees 
of freedom of the theory. In fact, there are many such subspaces, i.e. that 
parametrized by q{ti),q(t2), or that parametrized by q{ti),q{t2), etc. The 
essential point is that the action should be stationary when only variations 
of the quantities parametrizing the subspace are held fixed, because we must 
vary over all paths. This is not true of the two subspaces just given. If we 
could find a subspace for which this were true, we could salvage the action 
S2, by saying that the degrees of freedom have been mixed up in the g's and 
g's. 

For example, we might try writing the boundary contribution as 



1 2x f 



t2 

(3.21) 

ti 



and fixing the quantity ^ on the boundary. If fixing endpoint data for q/q 
leads to a unique solution for most choices, then the endpoint variation van- 
ishes upon fixing only this quantity, and we can say that the degree of free- 
dom is encoded in the combination q/q. However, this does not work, since 
the quantities |(ti) and ^(^2) do not parametrize a subspace over which the 
equation of motion is well-posed. We can see this by noting that the general 
solution to the equation of motion is q{t) = At + B, so fixing ^(^2) at the 
endpoints yields 

, ^ ^ = Ci, -r^—r = C2, (3.22) 
Ati + B ' At2 + B ' ^ ' 
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which cannot be solved for A, B given generic values of Ci, C2, ti, ^2- 

In fact, there is no way to find such a subspace for the action based on 
For this reason, if we are given the lagrangian L2, we must add the GHY 
type term (^gg) |*^, to bring the lagrangian back to Li, which is well-posed. 
As a general rule of thumb, if an action contains higher derivatives which 
appear only as total derivatives, a boundary term will need to be added. 

As a final example, which cleanly illustrates why the space of competing 
curves in the variational principle must be kept as large as possible, consider 
the lagrangian L3 = + q . The boundary variation is then q5q + 5qf^_^. We 
are fixing 5g, and we might think we can simply fix g = as well. As long 
as we can fix q to be those values given by solving the equations of motion 
(i.e. q = 0), it is still true that there is a two parameter {q{ti) and q{t2)) 
family of endpoint data, all of which yield a unique solution to the equation of 
motion. We are tempted to think of q as a second-class constrained variable. 
However, fixing q and hence 5q = at the endpoints restricts the class of 
paths over which we are extremizing. Taken to the extreme, we could also 
fix = at the endpoints, and so forth until the entire Taylor series is 
fixed and the only curve competing for the extremum is the solution itself. 
To avoid arbitrariness, we need to keep the space of competing paths as large 
as possible by varying over all paths consistent with the specified boundary 
data. The action based on L3 is not well-posed. 



3.6 Higher derivative lagrangians 

We now turn to an example with higher derivatives that is analogous to the 
F{R) gravity case. Consider the action for a single variable q{t) that involves 

''In general, to set up such a two dimensional subspace we must set functions of q or g 
on the boundary separately, or a function F{q, q) on both boundaries. It is clear that the 
first approach fails to set the above boundary variation to zero. We may imagine functions 
F{q,q) and / such that fSF = ^{qSq — qSq). These would have the desired property that 
setting F — Ci\t^ and F — 6*21*2 parametrize a two dimensional subspace and would imply 
the vanishing of the boundary variation. It is not difficult to show, however, that any such 
F must be of the form F{-) and thus is not a valid choice for the same reason that - is 
not. 
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an arbitrary function F of the second derivative g, which satisfies F" ^ 0, 

Si= r dtF{q), F'VO. (3.23) 
Jtl 

Variation gives the following 

5S, = r dt [F"'{q)q' + F"{qyq] 6q + [F'{q)6q - F"iq)qSq]\ll . (3.24) 
Jtl 

The bulk equation of motion can be solved for the highest derivative '4 in 
terms of the lower derivatives. This is a fourth order equation in standard 
form, and requires four pieces of boundary data to be well-posed. Fixing q 
and q at both endpoints is a valid choice. With this, the variation at the 
endpoints vanishes and we have a well defined action principle for two degrees 
of freedom. 

Si = Si[qi,qi,q2,q2,t]. (3.25) 



Consider now introducing an auxiliary field, A(t), to try and get rid of 
the higher derivatives (this can always be done, a general method is the 
Ostrogradski method [50 | l5T | [9]). 

^2= dt F{X) + F'{X){q-X). (3.26) 
Jtl 

Varying gives 

F'{X)6q - F"{X)X6q 
(3.27) 

This action is well-posed if q and q are held fixed at the boundary, and A 
is kept arbitrary. The equation of motion for A can be solved for A = q, 
which when plugged back into the action yields Si, and hence Si and 5*2 are 
equivalent by the fundamental theorem of auxiliary variables (see Appendix 




The action 5*2 still involves the higher derivatives g, but now they appear 
only through a total derivative, so we integrate by parts, 

^2 = dt F{X) - AF'(A) - F"{X)qX + F\X)q\\\ . (3.28) 
Jtl 



6So 



dt 



F'"{X)X^ + F"(A)A 6q+F"{X) [q - A] 5X+ 
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The integration by parts has generated a boundary contribution. We can 
render the action first order by subtracting this boundary term, that is, by 
adding to S2 the GHY type term — F'(A)g||^ 

The action and its variation now read 

^3= [ ' dt F{X) - XF'{X) - F"{X)qX, (3.29) 
Jti 

F"{X){XSq + q6X) 
(3.30) 

The bulk variation is unchanged because we have only added a total deriva- 
tive, but the boundary variation has changed. We must modify the vari- 
ational principle to keep A and q fixed on the boundary. The action now 
becomes a functional S3 = S'sfgi, Ai, g2, A2, t]. The degree of freedom q has 
has been shifted into A. 



dt 



F"\X)}? + F"(A)A 5q+F"{X) [q - A] 5X- 



Here adding the GHY type term was not strictly necessary to keep the 
variational principle well defined, as was the case in the previous section. 
Here the effect is simply to shift the degrees of freedom into different vari- 
ables]!] 

We are still free to eliminate the auxiliary variable A from the equations 
of motion. Doing so yields the action 

S, = j\tF{q)-j^iF'{q)q), (3.31) 

with variation 

6S,= f'dt [F'\q)V + F\q)T]5q-[F\q)q5q + F'\q)q5q]\^^^^ (3.32) 

The action is now a functional 6*4 = 6*4 [gi, g'l, ^2, The degrees of freedom 
have been relabeled as q and q. 

^This may be true of F{R) theory as well, that is, the boundary term obtained from 
the correspondence to scalar-tensor theory is not necessarily the only one that renders 
the variational principle well-posed. It is, however, the one that must be used if the 
correspondence to scalar-tensor theory is to be maintained at the boundary. 
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The moral of all this is that while adding total derivatives (equivalently, 
boundary terms) to the action does not change the bulk equations of motion, 
it can change the variational principle, and the corresponding labeling of 
degrees of freedom, or it can make it impossible for the variational principle 
to be well-posed. 



4 Review of the Gibbons-Hawking- York term 



The GHY boundary term is a modification to the Einstein-Hilbert action 
which makes the action well-posed. The modified action may be written as 
(conventions and definitions for boundary quantities laid out in appendices 
gandig. 



16nG S = 167iG{Seh + Sghy) = [ (Tx^R + 2 I (r^^x^\K, (4.1) 

Jv JdV 

Where G is Newton's constant. Upon varying the action, we arrive at 



+ 2(b d^'-'xy^lhldK. (4.2) 
Jav 

Here we have used the assumption 6g'^'^ = on dV, which also implies that 
the tangential derivative vanishes on dV, h'^^da^g^p = 0. Noting that 

6K = 5 (/i"^(9,n^ - r^^n^)) 



af3"'^J. 



^h'^^n>'d^6g^f,, (4.3) 



We see that 



16nG 5S = (T'x^g (^R,, - ^Rg.^j 5g^\ (4.4) 

We now have the boundary variation vanishing without any restriction on 
the normal derivatives. However, if this is the only property we desire, the 
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choice of 2K for the boundary term is not unique. We are free to add an 
arbitrary function of the metric, normal vector, and tangential derivatives, 
F{g^j_jj,n^,h'^^dp), because the variation of such an addition vanishes with 
the assumption dg^j^y = on dV. 

In fact, because of this freedom, Einstein unwittingly used the GHY 
boundary term well before either Gibbons, Hawking, or York proposed it 
[53j . He used an object H for the lagrangian, instead of R, 

H = g'-' {T^^K, - r;i,r:;,) . (4.5) 

This is sometimes called the gamma-gamma lagrangian for GR, and has 
the advantage that it is first order in the metric, so there is no need to fix 
derivatives of the metric at the boundary. 

H differs from i? by a total derivative, 

H = R- V„v4", 

where A" = g'^'^T'^^j — g^^V^^,. As such, it produces the same equations of 
motion, i.e. the Einstein equations, upon variation. It also possesses all the 
same bulk symmetries as the Einstein-Hilbert action, namely diffeomorphism 
invariance. Under a diffeomorphism, it does not change like a scalar, but it 
does change by a total derivative. 

We can see that the gamma-gamma action differs from the Einstein- 
Hilbert plus GHY action by a boundary term of the form F{g^y, ra^, h'^^dp), 

Jv Jv 

= [ (Tx^/^R - I (T-^x A'^n^. (4.6) 

Jv JdV 
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Bud 

A'n^ = -2K + 2h'"^dpn^ - n^'N^'d^g^^, (4.7) 

so this is an example of a choice of boundary term that differs from K hy a. 
function F, namely F = 2h°'^dpna — n^h'^'^dyga^. 

The Einstein-Hilbert plus GHY action requires even fewer variables than 
those of the metric to be fixed on the boundary. Only the induced metric 
hab need be fixed. To see this, we foliate in ADM variables with timelike 
hypersurfaces relative to i3, 

167rG Seh = [ (Tx^R = I (Tx f '^-^^i? - /Cab/C^^ + iC 

Jv Jv 

+ 2V« (r^V/3r° - r^V/jr^)] . (4.8) 

The total derivative term, when reduced to a surface term, cancels against 
the GHY term. So the action is 

IQ'kG{Seh + Sghy) = [ rx^/^R + 2 (f r-h^/\h\}C 

Jv Jb 

= [ (Tx [("-'^i? - /Cab/C"^ + IC] . (4.9) 

Jv 

(Here we are ignoring total time derivatives and the GHY terms on the 
endpoints, but a similar cancellation will apply there.) There are no radial 
derivatives of the lapse or shift, so their variation need not be set to zero 
on the boundary. Fixing the induced metric on the boundary is sufficient to 
render the action stationary. 

^In detail, 

= (n-g'^- - n'-g--) ^ [d,g,, + d,g^, - d^g^,] 

= (n^/i^"- - n^^h''-) ^ [d^g,„ + d,g^„ - d^g^,] 

= n^/i^'^r;:, - ^n^g'^'^h-^'^d^g,^ 

= -2K + 2h"'^dpn^ - n''h'"'d^ga^,. 
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For most choices of an induced metric on dV, the Einstein equations 
should produce a unique solution in V, up to diffeomorphisms that vanish 
at the boundary. In this case, the Einstein-Hilbert plus GHY action is well- 
posed. The counting in four dimensions goes as follows. Of the ten pieces of 
boundary data for the ten components of the metric, there are 4 constraints. 
The six components of the induced metric can be taken as the unconstrained 
components. These are subject to equivalence under four gauge transfor- 
mations, leaving two independent pieces of data, corresponding to the two 
degrees of freedom of the graviton. 

5 Higher derivative gravity and scalar-tensor 
theory 

Higher derivative theories of gravity have been studied extensively in many 
different contexts (for a review see |12]). They are invoked to explain cosmic 
acceleration without the need for dark energy [6l [3, EJ |9l [10] , and as quantum 
corrections to Einstein gravity [T3l [T^ . 

One can imagine trying to modify gravity by writing a quite general 
function of arbitrary curvature invariants, of any order in metric derivatives. 



Typically these are true higher derivative lagrangians, i.e. second and higher 
derivatives of the metric appear in a way that cannot be removed by adding 
total derivatives to the action, so the equations of motion are at least fourth 
order. The amount of gauge symmetry is typically unchanged from general 
relativity, i.e. diffeomorphism invariance remains the only gauge symmetry. 
This means that the theories either have more degrees of freedom than general 
relativity, the presence of second class constraints/auxiliary fields, or both. 

Such models are not as diverse as it might seem, since they are essentially 
all equivalent to various multiple scalar-tensor theories [221 [23 [2H1 [211 
[3U[ 131] ■ The transition to scalar-tensor theory amounts to an elimination 
of auxiliary fields represented by some of the higher derivatives of the met- 
ric. They are replaced by scalar fields that more efficiently encapsulate the 




gF{R, R^yR^'^ , R^i^xaR^"'^'^ , • • • , V^-RV^-R, Rfj.uR^xR^" ■, • • •)• 



(5.1) 
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physical degrees of freedom. 



We can get GHY terms for such theories by exploiting this equivalence. 
The equivalent scalar-tensor theory is typically a Brans-Dicke like theory 
in Jordan frame with a potential for the scalar fields, minimally coupled 
to matter. The theory can be brought to Einstein frame by a conformal 
transformation. The boundary term in Einstein frame is just the GHY term, 
so we can find the boundary term for the original higher derivative theory 
by taking the Einstein frame GHY term backwards through the conformal 
transformation and through the scalar-tensor equivalence. The term found 
in this way must be the correct one if the equivalence between the higher 
derivative theory and scalar-tensor theory is to hold for the boundary terms. 

The GHY terms obtained in this way are not generally sufficient to kill 
the boundary variation of the action when only Sg^^ = 0. This is simply a 
reflection of the fact that we are dealing with a higher-order theory. Some 
of the boundary values that the action depends upon involve derivatives of 
the metric, in a fashion exactly analogous to the fourth order toy example in 
section 3.6 The only metric theories where Sg^i, = should be sufficient are 
those with the special property that the equations of motion are still second 
order, despite the appearance of higher order terms in the action, namely 
the Lovelock lagrangians [33] • Indeed, such GHY terms can be found for 
Lovelock theory 



In what follows, we analyze in detail the simplest case, namely the case 
where the lagrangian is allowed to be an arbitrary function of the Ricci scalar, 
F[R), but does not contain any other curvature invariants. The extension 
to more complicated cases follows easily when the scalar-tensor equivalence 
is known. 



5.1 F{R) gravity 

F{R) theory is one of the most widely studied modifications of gravity OH]. 
It has the ability to explain cosmic acceleration without dark energy [HI El El E] 
and to evade local solar system constraints [Bl E] • 
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The action for F{R) gravity is 



S = j (Tx ^F{R). (5.2) 

We would typically add matter which is minimally coupled to the metric, but 
it plays no essential role in the boundary terms, so we will omit the matter in 
what follows. The Euler-Lagrange variation gives equations of motion which 
are fourth order in the metric. 

The equivalence to scalar-tensor theory is seen by introducing a scalar 
field, 0, 

S = j ciW=^[F(0) + F(0)(i?-0)]. (5.3) 

The equation of motion for the scalar is 

F"(0)(i?-0) = O, (5.4) 

which, provided F" ^ 0, implies R = (p. Note that the scalar has mass 
dimension 2. Plugging this back into the action, using the fundamental 
theorem of auxiliary fields, recovers the original F{R) action, so the two are 
classically equivalent. This is = Brans-Dicke theory with a scalar field 
F'{(f)), and a potential. 

In the GR limit F{R) — > R, we have F" — > so the transformation breaks 
down. As the limit is taken, the scalar field decouples from the theory [32] . 



5.2 Boundary terms for general scalar-tensor theory 



In this section we consider a general scalar-tensor action of the form 



V 



S= I rx^{ f{<P)R - ]^\{<P)g^^d,cpdA - U{<P) ] . (5.5) 



We show that this should be supplemented by the boundary term 

SGHY = 2<f d^-'x^/\h\f{4>)K. (5.6) 

JdV 
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We do this by showing that this reduces to the usual GHY term upon con- 
formal transformation to the Einstein frame. 



Once this is done, the equations of motion for the metric are obtained by 
setting dg^j^y = on the boundary, 



2/(0) (^R^y - ]^Rg^)j + 2Uf{ct>)g,y - 2V^V,/(0) = T^t, 



(5.7) 



where 



d^(pdu(p- ^gf,u{d(j)f 



g,Mi<P). (5.^ 



The equation of motion for the scalar field is obtained by setting 50 = on 
the boundary, 

A(0)n0 + ^y(0) (90)' - f/'(0) + /(0)i? = 0. (5.9) 

We now proceed with the conformal transformation, keeping careful track 
of all boundary contributions [IT]. Assuming 

/(0) 7^ 0, 

we can rewrite the action in terms of a conformaly re-scaled metric, 

^,. = [16vrG/(0)]^,,., 1^^°^^ (5.10) 
L JK^Ji y/..> |G'<Oif /(0)<O. ^ ^ 

G can be chosen to be anything consistent with the sign of /, and will 
become the Einstein frame Newton's constant. Re-writing the action is just 
a matter of using the conformal transformation formulae we have collected 
for convenience in appendix O In particular, we see from the second term of 



(C.8), used to rewrite R, that there is an integration by parts that will be 
necessary to bring the scalar kinetic term to its usual form. This will generate 
a surface term which must be combined with the conformal transformation 



of the GHY term ( |5.6[ ). 
The result is, 



S - I V^g [^R - lAim<pO.<p - F(0)) +^ £ d-^.^\K, 

(5.11) 
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where 



^<*' = ISg [-2m ^ —27W) ' 

V(^) = ^li^L^. (5.13) 

|16,rG/(,*)|" 

The GHY term has the usual form in Einstein frame, so working backwards, 
the Jordan frame expression must be correct as well. Variation is done with 
Sgfiu = 0, (50 = on the boundary. 



5.3 Boundary term for F{R) theory 

Adding the boundary term to the scalar-tensor form of the F{R) action, we 
have 



S = [ rf"xv^[F(0) + F'(0)(i?-0)] +2 / r-^x^/\h\F'{(l))K. (5.14) 

Jv JdV 

The variation of the action with respect to the scalar field now contains the 
boundary contribution 

2 / (^~^x^/\h\F"{(P)KS4>, (5.15) 
Jav 

which vanishes since we require 50 = on the boundary. 
Plugging back in we have 

S = [ (Tx ^F{R) + 2 I (r-^x^\F'{R)K. (5.16) 

Jv JdV 



This boundary term has been arrived at before in several different con- 
texts, sometimes indirectly [151 HSl HZl [HI |T9] . However, there has been some 
confusion, because this boundary term is not enough to make the action sta- 
tionary given only dg^j^y = on the boundary. Indeed, there is in general 
no such boundary term with this property [IH]. We see that this is because 
R now carries the scalar degree of freedom, so we must set 5R = on the 
boundary as well. 
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We will now go on to accumulate some evidence that this is indeed the 
correct boundary term. We first calculate the energy in the hamiltonian 
formalism, and show that to obtain the correct energy that reduces to the 
ADM energy when F{R) ~ R, this boundary term must be included. We 
then calculate the entropy of Schwartzschild black holes, and show that in 
order to reproduce the results of the Wald entropy formula, the boundary 
term is necessary. 



6 Hamiltonian formulation and ADM en- 
ergy 



In this section, we will develop the hamiltonian formulation of a general 
scalar-tensor theory, which will encompass the F{R) case. We will stay in 
Jordan frame, and keep track of all boundary terms. Just as in GR, the 
bulk hamiltonian vanishes on shell, and the only contribution comes from 
the boundary, so it will be essential to include the GHY terms found in the 
previous section. 



We start with the action (5.5), which we write as S" = Sq + + Sb, 
where 



Sg 
Sghy 



[ (Tx^ f{^)R, 
Jv 



V 



2 / dT^-^x 



(6.1) 
(6.2) 
(6.3) 



We change to ADM variables [52] , (see appendices A and [B] for conven- 
tions and definitions of the various quantities). The boundary term splits 
into three integrals over the three boundaries. The integral over S2 gets an 
additional minus sign, since by convention the normal should be directed 
inward for spacelike surfaces, whereas for S2 it is directed outward. We will 
suppress the argument of /, g and A over the course of the calculation. 



28 



First look at the combination S — Sq + Sghy, 

Sg + Sghy = / rxy/^ f[^^-'^R - + K^'Kab - 2Va(n^V^n" - 
Jv 

+2 / cT-^z^f^lCf + 2 / cr~^yVhKf - 2 / (f'-^yVhKf. 
Jb /si /s2 

(6.4) 

Integrating by parts the last term in the bulk integral, we find surface con- 
tributions that exactly cancel the boundary integrals over Ei and E2. The 
surface contribution over B does not cancel its corresponding boundary in- 
tegral, and we are left with, 

Sg + Sghy = [ rxVhN [f {^''-'^R - + K^'Kab) + 2f'{n^Vpn'' - n''K)da<p] 
Jv 

+2 £ r-^z^fiJC - Van^Wpn^ (6.5) 
We can simplify the integrand of the boundary piece, 

= (cr^^e^ej - n"n^) V^r„ + n^n^V^jr^ 

= k. (6.6) 
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The bulk terms multiplying /' can be further simplifiecQ 

Putting all this together, we have 

Sg + Sghy = [ rx VhN[f{^'^~'^R-K^ + K'''Kab) 
Jv 

+2 / (T-^z^Nfk. 

B 



We now specialize to the {t,y"') coordinate system, in which ria 
= The term h°"^n^df3ni^ vanishes. We are left with 

Sg + Sghy = [ d^xVh [Nf -K' + K'^'Kab) 

Jv 

+2 j (T-^z^Nfk. 



^In detail, 
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The scalar action, in ADM variables, is 



S„ 



hX 



2N 



-NVhU. 
(6.10) 

Define now 

S'c = I (Tx^ [Nfl^'^-^^R -K^ + K^^Kab] + 2fh''^daNdk4)) (6.11) 

(6.12) 
(6.13) 

S = S'a + S'+S's. (6.14) 



S'^ = S^-2 I rxv^f'Ki^ - N'^da^ 
Jv 

S'b = 2^ (T-^z^Nfk, 
Jb 



The action is now in a form amenable to transition to the hamiltonian, 
namely it is in the form of a time integral over a lagrangian, L, (which is 
itself a space integral plus boundary parts) containing no time derivatives 
higher than first, and no boundary contributions at the time endpoints. 



S 



t2 



dt L 



hab, Kb, N, AT'^, 



(6.15) 



Note that this would not be the case were it not for the GHY term on 
the surfaces Si and S2. It now remains to transition to the hamiltonian 
formulation. This has been done without keeping surface terms by [371 138] , 
and at the level of the equations of motion by [39]. We start by finding the 
canonical momentum conjugate to 0, 



6L 



h\ 



N 



(^0 - N'^dacj)'^ - 2VhfK. (6. 16) 



To find the canonical momenta conjugate to hab, we vary with respect 
to Kab, then use the relation Kab = ^ ( hab — VaA'5 — VbNa ) to replace 
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ab 



P 



6L 



6k 



ab 



Vh 



(6.17) 



Equations (6.16) and (6.17) are invertible for and K (which is essentially 



hab) in terms of p^, p"^ (and 0, hab, N, A^"), 

^ ^ N r in-2)fp^-2fp 

h L2(n-l)/'2 + (n-2)/Aj 



K, 



ab 



1 Pab 



h. 



ab 



p^f + 2p^ + p\ 
2{n - l)/'2 + {n- 2)/A 



(6.18) 



where p = h°'^pab- The canonical momenta conjugate to and A^" both 
vanish, just as in GR. 

Notice that the map from velocities (0, hab) to momenta {p^, pab) is non- 
singular even when the scalar kinetic term vanishes, A — 0, as in the case 
of theories equivalent to F{R). This corresponds to the fact that the scalar 
is still dynamical in this limit, by virtue of its non-minimal coupling. The 
case / — s> const 7^ is also well behaved, provided A 7^ 0, as the scalar has 
dynamics stemming from the kinetic term. The case / const and A — 
is indeed singular, because the scalar field then loses its dynamics. 

We now start the calculation of the hamiltonian. Starting with the scalar 
field part, we must express everything in terms of the fields and momenta, 
by eliminating all time derivatives, 



H' 



-L' 



,K 



A 



(6.19) 
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Here we must treat Kat as the function of p"'', hab, and p^f, given by (6.18). 



Naively, the above appears singular in the limit A ^ (the F{R) case), 
however once we re-express Kab in terms of the momenta, the 1/A terms 
cancel and the limit is smooth. 



Now the metric part. 



G 



'G 



+2fh'''daNdb'f) . (6.20) 

Integrate by parts to pull all the derivatives off of and A^", being sure to 
keep the boundary contributions. 



H' 



G 



ab 



2N^Kab - Nf (("-^)i? -K' + K'^'Kab) 
yh 



+ 2NV\fVa<P)-2NaS/b 



p 



ab 



+2 j d^'-^e^ Ta [^b^ - Nfd^ct!^ 



(6.21) 



The boundary term in the lagrangian contributes 



H'b = -L'b 



-2 (h d'^-'e^Nfk. 

'St 



Combining these terms yields the full hamiltonian 

= [ d^'-^yVhlNf {-^''-'^R- + K^^Kab) 
Jut 

1 pI , 



p 



+N { ^Ka, + 2V"(rV.0) + -'^ + ^ + 2^h''9,<l'9,<p + U 



ab 



+2 (h d''-^e^ 

'St 



ab 



r,N,^-N{fk + r-fda^ 
h 



(6.22) 
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This hamiltonian is like that for GR in the sense that the equations of 
motion for and A^" are Lagrange muhiphers which cause the bulk terms 



to vanish. We have not bothered to write out Kab using (6.18) because it 
because it does not contain N or iV". 

The hamiltonian evaluated on solutions reduces to the boundary part 

Hsoiuuon = 2 i d^-^Oy^raNb^ - N{fk + r-fda<p). (6.23) 
J St yh 

The ADM energy, is given by choosing the lapse to vanish and the shift 
to be unity, 

E = -2(f dJ'-'^d [fk + fr^'dact)] ■ (6.24) 

J St 

Alternatively, we could also have obtained this expression by finding the 
hamiltonian in Einstein frame and then performing a conformal transforma- 
tion. 

As an example, consider the Schwartzschild solution in four dimensions, 

ds^ = -f(r)df + -j^dr^ + r^du^, = 0o, (6.25) 
fir) 

where /(r) = 1 — GM is a constant, and 0o is a constant. This will 



be a solution to the scalar-tensor theory equations of motion (5.7), (5.8) 



and (5.9), provided that Minkowski space is a solution, and 00 is set to the 
vacuum value of in the Minkowski solution. The ADM energy for this 
solution i^ 

E = 167rG'M/(0o), (6.26) 

which is what one expects, given that /(0o) is playing the role of the effective 
gravitational constant, /(0o) 



1 



167rGcff ■ 



F{R) theory is the special case where /(0) = F'{(j)), U{(f)) = F'{ 
F(0), A(0) = 0. In order to hamiltonize F{R) theory, it must first be brought 
to first order form (this is essentially the content of the Ostrogradski method 
for hamiltonizing higher order systems [HOI E])- Passing to the scalar- 
tensor description is the simplest way to do this, so we have also found the 



^Recall that we must measure the energy relative to the energy of Minkowski space, 
which is the vacuum solution. The energy of each is individually divergent as r ~^ oo, but 
the difference is finite and yields the above expression. 
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ADM energy for F{R) theory. The boundary term must be passed in the 
same way, and so plays the same essential role. 



7 Black hole entropy 

When calculating the entropy of a black hole in the euclidean semi-classical 
approximation, it is essential to have the correct GHY term [33]. In fact, 
this term is responsible for the entire contribution to the euclidean action. 
In this section we will calculate the entropy for a Schwartzschild black hole 
in F{R) theory using our boundary term, and compare this to the entropy 
given by the Wald entropy formula. 

The Wald entropy formula allows one to calculate the entropy of a black 
hole in any diffeomorphism invariant metric theory of gravity. It involves 
an integral over the bifurcation two sphere of the horizon of the black hole 
POl 1^ . The formula does not rely on having a well-posed action principle, 
i.e. it depends only on the lagrangian density, and hence the GHY terms 
are not needed in order to apply it. For a Schwartzschild black hole in F[R) 
theory in four dimensions, the Wald formula gives the result |33] 

Sbh = jlQTrF'iRo), (7.1) 

where A is the area of the horizon and Rq is the (constant) background 
curvature of the spacetime the black hole sits in. Wald has shown that his 
entropy formula gives the same value as the euclidean semi-classical approach 
for theories which satisfy several conditions in addition to those needed by 
the formula itself j42j. One of these additional conditions is that there be 
a variational principle for the action where only the metric is held fixed on 
the boundary, so this does not cover the general F{R) case. Nevertheless, 
we will see that our surface term still gives the entropy in agreement with 
Wald's formula. 

The partition function in the semiclassical limit is 

Z[/3]=e-^-, (7.2) 

where P = ^ is the inverse temperature and Se is the euclidean action of the 
dominant classical field configuration where the time variable is identified 
with period f3. 
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In Einstein gravity in four dimensions, we have the Schwartzschild solu- 
tion 

ds^ = -f{r)de + -^dr^ + r^duj\ (7.3) 

where /(r) = 1 - and M is the ADM mass. 

The temperature can be determined by finding the corresponding solution 
to the euclidean action, which amounts to taking t —>■ ir, and then finding 
the period of r required to eliminate the conical singularity at the horizon. 
The resulting metric is then 

dsl = f(r)dT^ + -^dr^ + r^du\ t = t + P, (3 = MGM. (7.4) 
fir) 

The temperature of a black hole is model-independent, since no use of the 
Einstein equations was made other than the fact that Schwartzschild is a 
solution. Thus in any theory of modified gravity where Schwartzschild is a 
solution, the black hole will have the same temperature. 

We first review the euclidean action calculation in GR, since the extension 
to F{R) is then trivial. The euclidean action for GR is 

IQtiGSe = - [ d^x ^/\g\R -2 I d^x VM(^ - ^o)- (7.5) 

Jv JdV 

The action for a general theory must be zeroed on the action for the back- 
ground one is expanding about, i.e. the vacuum state. The true, finite action 
is thus S — So, where 5*0 is the action of the background. In our case, the 
background is fiat space. The bulk contribution to Sq vanishes, and all that 
remains is Kq, the extrinsic curvature of the boundary as measured in fiat 
space. The term Kq is often called the boundary counterterm (misappropri- 
ated from the quantum field theory jargon) and may in general have to take 
a more complicated form ^9]). 

Since the Ricci curvature of Schwartzschild vanishes, the bulk term does 
not contribute to the classical action. The entire contribution comes from 
the boundary term. Taking the boundary to be a sphere of radius r about 
the origin, we have 

(f d^x ^/\h\K = ^n(3{2r-3GM). (7.6) 

JdV 
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For the background, we periodically identify the time to the period /3, red- 
shifted by the factor (l - ^)^''^ 

j) (fx vT^Xo = 87r/3r f 1 - j . (7.7) 
Taking the difference and the limit r — > oo yields 

The free energy, entropy, and energy are then 

E = ^ + (7.10) 

where A = A'K{2GM f is the area of the horizon. 

We now extend the calculation to F{R) theory. The bulk vacuum equa- 
tions of motion are 

F'{R)R,, - \F{R)g^, + g^.V^F'{R) - V^V,F'{R) = 0, (7.12) 

which are fourth order in the metric, as expected. AdS/dS Schwartzschild 
has the property R^^,, = jQ^v, with R = Rq a constant. Using this ansatz 
the equation of motion reduces to 

F'{Ro)Ro - 2F{Ro) = 0, (7.13) 

which is an algebraic equation for Rq. This is the same equation one would 

obtain seeking constant curvature solutions, so for every constant curvature 
background, wc also have a Schwartzschild black hole that asymptotically 
approaches this background. 

The euclidean action, including the GHY term, is 



Se + So — — 



[ d^x ^/\g\F{R)-2 (f d^x^\F'{R)K. (7.14) 

Jv JdV 
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Here 5*0 is the action for the background. Since the Schwartzschild solution 
has the same constant curvature as the background, the bulk contribution 
vanishes, and the entire contribution again comes from the boundary term, 

Se = -2F'{Ro) (f d^x^/\h\{K - Ko) . (7.15) 

JdV 

where Rq is the scalar curvature of the background/black hole. 

If we assume for simplicity that F{R) is such that there is a fiat space 
solution, then the calculation proceeds just as in the GR case, with the result 

Se = P'F'{Ro). (7.16) 

The thermodynamic quantities are then 

F = PF\Ro), E = 167iGF'{Ro)M, S = -167rF'(i?o)- (7.17) 



These formulae make good sense from the point of view of the scalar- 
tensor theory. The scalar field is just R, the Ricci curvature, so the Schwartzschild 
solutions we are considering have constant scalar field everywhere. This is in 
accord with no hair theorems, which forbid non-trivial scalar profiles around 
black holes in space with zero or positive curvature ^51 H6l W7\ . The value 
of the effective Newton's constant is set by the constant value of the scalar 
field 

TTT^ = F'{Ro), (7.18) 

so the entropy is simply one quarter the area of the horizon in units of the 
effective Planck length. The energy is the ADM energy we found in the 
previous section. 

An interesting consequence of this formula is that higher curvature terms 



make no correction to the entropy. From (7.13) we see that if i? = is to be 



a solution, then -F(O) = 0, so F can be taylor expanded around the origin, 

starting with the Einstein-Hilbert term: F{R) = F'{0)R + |F"(0)i?2 H . 

All the higher power corrections to the action do not affect the entropy. In 
particular, the entropy of a black hole in pure i?" gravity, for n >2, vanishes. 
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8 Conclusions 



Having correct boundary terms is essential to the consistency of any theory. 
One can generally get away without them, but there are instances where they 
are vitally important. We have argued that consistent GHY terms for higher 
derivative modified gravity theories can be obtained by using any scalar- 
tensor equivalence the theory may posses. The boundary terms obtained 
in this way, while not necessarily unique, do give a well-posed variational 
problem and give the expected ADM energy and black hole entropy, even 
though derivatives of the metric may have to be fixed on the boundary. 

What we have given is by no means a complete analysis of boundary 
terms, however. For a general lagrangian of any kind of field, not necessarily 
a modified gravity theory, it is far from clear whether there always exists a 
boundary term that renders the variational principle well-posed. Further- 
more, even if such a term can be found, it is not evident what freedom is 
allowed in choosing the boundary term, or what physical significance this 
freedom entails. 
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A The ADM decomposition 

Here we review the ADM hypersurface decomposition of spacetime, and lay 
out our conventions in the process. The conventions and notation are those 
of In this appendix we will describe a generic foliation of a volume 

V, by spacelike or timelike hypersurfaces, and in the next appendix we will 
describe the specific foliations we use throughout the paper. 

Put coordinates on V. We foliate the volume V with hypersurfaces 
by giving a global time function t{x'^) and declaring the hypersurfaces to be 
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its level sets. We then set up functions y°'{x^), a — 1 . . .n — 1, independent 
of t{x^) and each other, to serve as coordinates on the submanifolds. Taken 
together, t and y"' form a new coordinate system on V, and we have an 
invertible transformation from the these coordinates to the old ones 

x^{y\t), t{x^), y\x^). (A.l) 



The coordinate basis vectors of this new coordinate system are 



The dual one forms are 



f - — e « - ^ f A 3) 



They satisfy the duality and completeness relations 

t%^\ e\e^^hl t'^e;^e^J,^0. (A.4) 
tH, + e^,~ej^ = 5^,. (A.5) 

Introduce a bulk metric g^i,. There is now a well defined one-dimensional 
normal subspace at each point of S^, which may be different from the sub- 
space spanned by t^. We set up a basis consisting of the forward pointing 
unit normal vector along with the e^^. The e'^^ are not required to be 
orthonormal among themselves, but are orthogonal to n^. We have 

g^.n^'rf^e, ^^.e'^X = 0. (A.6) 

Here e is defined by 



1 timelike 
-1 Tit spacehke 



(A.7) 



We define the associated dual forms e^, hfj,, at each point, 

nX = l, eV; = ^.^ n'^e; = e^X = 0. (A.8) 
n^n, + e'^^e,'^ = 5^. (A.9) 
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(We use for the dual one form, and reserve for the form gfxnvy . They 
differ by a sign for spacelike hypersurfaces, i.e. = ^^1^.) 

A vector field A'^ is called parallel if it admits the decomposition — 
A"e^^. A form is parallel if it admits the decomposition A^ = AaC^. 
Similarly, a general tensor is parallel if it admits a similar decomposition, for 
example, a (2, 1) tensor is parallel if 

There is a bijective relation between tensors on the submanifold (really a 
one parameter family of tensors, one on each surface, parametrized by t) and 
parallel tensors in the bulk. Given a parallel bulk tensor A^'^^, it corresponds 
to the submanifold tensor A"'^^, and vice versa. 

Define the projection tensor 

P^^^ ^5ii- rfh,. (A.ll) 

It projects the tangent space of V onto the tangent space of E^, along the 
subspace spanned by n^. It satisfies 

P\P\ = Pt, (A. 12) 

ne\ = e^,, P>^ = (A.13) 
Pte; = e/, P>, = 0- (A.14) 

Given any bulk tensor, e.g. Ai^"^, we can make a parallel tensor by pro- 
jecting it, 

j^W^u^ = P^'^P'^^P^A''^. (A.15) 
A tensor is parallel if and only if it is equal to its projection. 

We have the relation 

eV." = n- (A. 16) 

Projecting the metric gives the induced metric h on the hypersurfaces, 

h^u — P^^P i^Qpiy hah&ix 1 hab ^^a^ ^^a^ b9nv (A- 17) 
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We raise and lower bulk indices z/ . . . with (yf^,^ and its inverse g'^'^, and we 
raise and lower submanifold indices a,b, . . . with hab and its inverse /i"^. In 
particular, we have, 

}f^g^^.e\ = e/, g„^.rf = n^^ eh^, (A.18) 

/iab^^V = e'^^, ^'^'^n. = n'^ = en^ (A.19) 

as well as 

9,aP% = K., g^^P\^h>^r (A.20) 

To perform the ADM decomposition, we want to write the bulk metric in 
the (?/",t) coordinate system. We start by expanding the coordinate vector 
over the new basis, 

= Nni" + A^"e^„. (A.21) 

The coefficients and A^" arc called the lapse and the shift, respectively. 
We have 

if, = d^t = e-n^, = eNt^ = eNd^t. (A.22) 

The coordinate one-forms are 

dxf" = t^dt + e^Jy" = Nn^dt + {N^dt + dy") e^^. (A.23) 

The metric is 

g^^dx^dx"" = eN^dt + hab {N^dt + dy") {N''dt + dy^) . (A.24) 
In matrix form this is, 

■ Ar„ 



eN^ + N^Na 

9 = ■ 



h 



ah 



(A.25) 



The inverse metric is 

9' 



(A.26) 



The square root of the norm of the determinant is 

V\9\ = N^\. (A.27) 
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The normal vector is 



1 A^" 
n° = -, n'' = - — , no = eN, = 0. (A.28) 



The extrinsic curvature is a parallel tensor defined by 

Kab = e^,e\V^n,. (A.29) 

It is symmetric, 

Kab = Kba, (A.30) 

as can be easily shown by noting that the basis vectors have zero Lie bracket, 
e'^feV^e^^ = e^'^V^.e^ft. We also have 

Kab = V(^n^)e^^e\ = ^e'^^e^Cngi.u, (A.31) 

where jC is the Lie derivative. The trace of the extrinsic curvature is given 
by 

K = h'^^'Kab = V^n^ (A.32) 
In terms of ADM variables, we have 

Kab = ^ [Kb - VaNb - VbNa) ■ (A.33) 

We often use the decomposition 

R = + e{K^- KabK"^) + 2eVa (n'^V/jn" - n^K) . {AM) 

Here R is the Ricci scalar constructed from g^^, and is the Ricci scalar 

constructed from the induced metric hab- 



B Foliation of spacetime 



Throughout this paper it is necessary to refer to spacetime, boundaries of 
spacetime, foliations of spacetime, and all of the geometrical quantities that 
these induce. As such it is necessary to layout some standard terminology to 
describe the situation. See figure [2j 
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Figure 2: Foliation of spacetime 



Spacetime will be referred to as V bounded by dV. dV has three parts, 
a timelike boundary B, a spacelike initial surface Ei and a spacelike final 
surface S2 . We foliate the volume V with hypersurfaces by giving a 
global time function t{x) and declaring the hypersurfaces to be its level sets. 
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The top and bottom surfaces Si and E2 are to coincide with Ej^ and S^j, for 
some times ti and t2- The normal vector to the is denoted and is future 
pointing. The coordinates on are j/". The lapse and shift relative to this 
fohation are N, A^". The induced metric is hab and the extrinsic curvature is 

Kab. 

The boundary B can be thought of as part of a foliation by timelike 
surfaces. The coordinates on B are z*. The radially outward pointing normal 
vector is r^. We demand that the surfaces E^ intersect B orthogonally, so 
that g^uT^^n^ = on i5. This implies that is parallel to E^ on B, so that 

= r^-e^^ there, for some r". The lapse and shift relative to this foliation 
are A/", A/"*. The induced metric is 7jj, and the extrinsic curvature is /Cjj. 

The intersections of E^ with B arc denoted = E^ fl B. These form a 
spacelike foliation of B. The coordinates on St are 9^^ the induced metric is 
(Tab, and the extrinsic curvature of St as embedded in Ej is kAB- 



C Conformal transformation formulae 

Here we collect some formulae on conformal transformations of the metric. 

Qnv = u;'^{x)g^^, u;{x) > 0. (C.l) 
-gf''' ^uj-'^{x)gi'r (C.2) 

The connection transforms as 

= {^d,^ + 5id,uj - g.^g'^dau) . (C.4) 

The curvature scalar transforms as 

R = uj-'^R - 2{n - l)^"^cu-3V„V^cu - (n - l)(n - 4)g'"^(jj-^da(jjd^(jJ. (C.5) 
Some convenient covariant derivative transformations are 

V^V,0 = V^V,(/. - (5^5^ + 6li6': g,,g^^) uj-^d^ujdpct>. (C.6) 

□0 = uj-'^n^ +{n- 2)g'^^uj-^do,ujdi3(l). (C.7) 
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The inverse transformations are 

R = u'^R + 2{n - 1)^°^cjV„V/3W - n{n - l)g''^daUjdpu. (C.8) 

= + (5^5^ + 6^^6: - -g^^r") uo-^d^oodp^p. (C.9) 

□0 = u'^U(t)-{n- 2)g''^ujdaUjdp(f). (C. 10) 

The various 3+1 dimensional quantities transform as follows 

hab = uj^hab, iV'^ = iV^ N = ujN, (C.ll) 

}f^ = UJ-^h^\ Na = UJ^Na, (C.12) 

h^ = uj~^n^, h^ = ujn^, (C.13) 

kab = UjKab + habUf'd^.UJ, (C.14) 

K = uj~^K + 00-^71-1)71" df,uj. (C.15) 



D The fundamental theorem of auxihary vari- 
ables 

Here we recall some facts about auxiliary fields in classical field theory. For 
more, see for example [Ml 123] . 

Suppose the lagrangian depends on two sets of fields 0* and x"^; = 
£([0], [x]) 2;), (here [ ] stands for dependence on the fields and any of its 
spacetime derivatives of arbitrary but finite order) and that the equations of 
motion for can be solved in terms of the 

Plugging these relations back into C, we get a lagrangian depending only on 
the 0*, which we call C, 

cm,x)^cm,[xm,x)],x). (d.2) 
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We will christen the following the fundamental theorem of auxiliary vari- 
ables. 



• The equations of motion derived from L and L are equivalent in the 0* 
sector, i.e. if ^^{x) is a solution to ^-j^ = 0, then it is also a solution 

to ^^i^ = 0, and vice versa, if is a solution to = 0, then it 

is also a solution to = 0, with the extension to the given by 

the obtained from solving the equations of motion. 

Proving this is a matter of convincing yourself that you can extremize a 
function of several variables either by extremizing with respect to all the 
variables, or by first extremizing with respect to a few, and then with respect 
to the rest. 

The other important property that hold when eliminating auxiliary vari- 
ables is that both the global and gauge symmetry groups of C and £ are 
the same. One does not lose or gain symmetries by eliminating or adding 
auxiliary variables. 

Notice that in the fundamental theorem, there is no requirement that the 
auxiliary fields be solved for algebraically. However, when this is not the 
case, one must be careful about boundary contributions. To illustrate this, 
wc reprint here a nice example taken from the exercises of chapter one of 
[23]. 

Consider the action for two variables q{t) and A{t) with values fixed on 
the endpoints. 

The equations of motion for q and A are, respectively. 



d A\ , , 

Notice that the A equation implies A = cq"^, where c is a constant of integra- 
tion. Plugging this back into the action, we obtain 

S = dt ^ [e + c'q^) , (D.5) 
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which gives naively for the q equation of motion q — c^q = 0. Looking back 
at the original q equation of motion, and plugging in A = cq"^, we obtain 
q + c^q = 0, a contradiction for 7^ 0. 

It is often said that auxiliary variables can only be eliminated if their 
equations of motion can be solved algebraically, and "counterexamples" like 
this are quoted to illustrate this. However, if we are careful to take into ac- 
count the endpoints, we can resolve the problem. Consider again the equation 

of motion for A, ^ j = 0. We must solve this subject to the endpoint 

conditions A(ti) = Ai, A{t2) = A2. Integrating both sides of the solution 
A = cq"^, we find that the constant c actually depends non- locally on q, as 
well as on the endpoint data for A, 

c= f--^' . (D,6) 



Plugging this more careful result into the action, we obtain a non-local 
action for g. 

Varying this carefully with respect to g, we recover the correct (non-linear, 
non-local) equation of motion q + c^q = 0. 
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